Topic Tracking for Punjabi Language

نویسندگان

  • Kamaldeep Kaur
  • Vishal Gupta
چکیده

This paper introduces Topic Tracking for Punjabi language. Text mining is a field that automatically extracts previously unknown and useful information from unstructured textual data. It has strong connections with natural language processing. NLP has produced technologies that teach computers natural language so that they may analyze, understand and even generate text. Topic tracking is one of the technologies that has been developed and can be used in the text mining process. The main purpose of topic tracking is to identify and follow events presented in multiple news sources, including newswires, radio and TV broadcasts. It collects dispersed information together and makes it easy for user to get a general understanding. Not much work has been done in Topic tracking for Indian Languages in general and Punjabi in particular. First we survey various approaches available for Topic Tracking, then represent our approach for Punjabi. The experimental results are shown.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentiment Analysis on Punjabi News Articles Using SVM

Sentiment analysis is a field of Natural Language Processing and it is the most trending field of research. In the process of text mining that is used to find out people’s opinion about a particular product, topic and predicting market trends or outcomes of elections, detecting and classifying sentiments from the text. Sentiment analysis on Punjabi language is to be performed because of increas...

متن کامل

Punjabi Text Clustering by Sentence Structure Analysis

Punjabi Text Document Clustering is done by analyzing the sentence structure of similar documents sharing same topics and grouping them into clusters. The prevalent algorithms in this field utilize the vector space model which treats the documents as a bag of words. The meaning in natural language inherently depends on the word sequences which are overlooked and ignored while clustering. The cu...

متن کامل

Automatic Text Summarization System for Punjabi Language

This paper concentrates on single document multi news Punjabi extractive summarizer. Although lot of research is going on in field of multi document news summarization systems but not even a single paper was found in literature for single document multi news summarization for any language. It is first time that this system has been developed for Punjabi language and is available online at: http...

متن کامل

An Automatic Spontaneous Live Speech Recognition System for Punjabi Language Corpus

In spontaneous Punjabi speech model, the speech is basically non-planed and non designed, there are generally depicted by repetitions, preservation, wrong start, half-spoken words and non-planned words, silence gap etc. In a system of Punjabi speech detection including vocabulary, the identification needs the evaluation among the audio signal of the utterance and the variety of utterances of th...

متن کامل

Automatic Spontaneous Speech Recognition for Punjabi Language Interview Speech Corpus

Automatic Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. The main objective of the research work is to develop the automatic spontaneous speech model for the Punjabi language. Punjabi is categorized as a constituent of the Indo-Ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011